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ABSTRACT 

The Excursion Set approach has been used to make predictions for a number of inter- 
esting quantities in studies of nonhnear hierarchical clustering. These include the halo 
mass function, halo merger rates, halo formation times and masses, halo clustering, 
analogous quantities for voids, and the distribution of dark matter counts in randomly 
placed cells. The approach assumes that all these quantities can be mapped to prob- 
lems involving the first crossing distribution of a suitably chosen barrier by random 
walks. Most analytic expressions for these distributions ignore the fact that, although 
different fc-modes in the initial Gaussian field are uncorrelated, this is not true in real 
space: the values of the density field at a given spatial position, when smoothed on 
different real-space scales, are correlated in a nontrivial way. As a result, the problem 
is to estimate first crossing distribution by random walks having correlated rather than 
uncorrelated steps. In 1990, Peacock & Heavens presented a simple approximation for 
the first crossing distribution of a single barrier of constant height by walks with cor- 
related steps. We show that their approximation can be thought of as a correction to 
the distribution associated with what we call smooth completely correlated walks. We 
then use this insight to extend their approach to treat moving barriers, as well as walks 
that are constrained to pass through a certain point before crossing the barrier. For 
the latter, we show that a simple rescaling, inspired by bivariate Gaussian statistics, 
of the unconditional first crossing distribution, accurately describes the conditional 
distribution, independently of the choice of analytical prescription for the former. In 
all cases, comparison with Monte-Carlo solutions of the problem shows reasonably 
good agreement. This represents the first explicit demonstration of the accuracy of an 
analytic treatment of all these aspects of the correlated steps problem. While our main 
focus is on first crossing distributions of deterministic barriers by random walks, in an 
Appendix we also discuss several issues that arise upon introducing a stochasticity in 
the barrier height, a topic which has gained interest recently with regards the mapping 
between first crossing distributions and halo mass functions. 
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1 INTRODUCTION 

The abundance of clusters and its evolution is a useful probe 
of the primordial fluctuation field, the subsequent expansion 
history of the universe, and the nature of gravity. This is, 
in part, because there is an analytic framework for under- 
standing how cluster formation and evolution depends on 
the background cosmological model. Analyses based on the 
assumption that clusters form from a spherical collapse sug- 
gest that a cluster today is a region that is about 200 times 
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the background density, and it formed from the collapse of a 
sufficiently overdense region in the initial conditions (Gunn 
& Gott 1972). Numerical simulations suggest that this ex- 
pectation is reasonably accurate. 

Hence, the problem of estimating cluster abundances 
at any given time reduces to the problem of estimating the 
abundance of sufficiently overdense regions in the initial con- 
ditions (Press & Schechter 1974). However, the overdensity 
associated with a given position in space depends on scale (in 
homogeneous cosmologies, the likely range of overdensities is 
smaller on large scales). So, to estimate cluster abundances, 
the problem is to find those regions in the initial conditions 
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which are sufficiently overdense on a given smoothing scale, 
but not on a larger scale. This is because, if the larger re- 
gion is sufficiently overdense, then, as it pulls itself together 
against the expansion of the background universe and col- 
lapses, it will also squeeze the regions within it to smaller 
and smaller sizes. The framework for not double-counting 
the smaller overdense regions that are embedded in larger 
ones is known as the Excursion Set approach (Epstein 1983; 
Bond et al. 1991; Lacey & Cole 1993). With some care, one 
can build an Excursion Set model for voids as well (Sheth 
& van de Weygaert 2004). 

Within the context of the Excursion Set approach, the 
spherical evolution model exhibits an important technical 
simplification: the critical overdensity 5c required for col- 
lapse at a given time is independent of the mass or size of 
the final object. In fact, neither clusters nor voids are spher- 
ical, and the critical overdensity associated with collapsed 
objects depends on how different from spherical the object 
is. As a result, the critical overdensity required for collapse, 
when averaged over objects of a given mass, becomes mass 
dependent (Sheth, Mo & Tormen 2001). In modified gravity 
models, mass dependence of the critical overdensity appears 
even in spherical evolution models (Martino, Stabenau & 
Sheth 2009; Brax, Rosenfeld & Steer 2010). 

While mass-dependence of 5c does not complicate the 
logical framework of the Excursion Set description, it does 
impact the ability to obtain exact analytic expressions for 
the quantities of interest. Nevertheless, simple accurate ap- 
proximations have been developed (Sheth & Tormen 2002; 
Lam & Sheth 2009). In addition to allowing one to predict 
cluster abundances, these allow one to produce accurate Ex- 
cursion Set models which describe how the probability dis- 
tribution function of mass in randomly placed cells depends 
on cell size (Sheth 1998; Lam & Sheth 2008). In some re- 
spects, clusters can be thought of as cells of vanishingly small 
size, so the excursion set model for the counts-in-cells dis- 
tribution is also a model for the density run around clusters 
(and voids) on scales that are larger than the virial radius 
(or void wall). 

To obtain analytic expressions, all of these analyses as- 
sume that the density field on one scale is trivially correlated 
with that on another scale. If one plots the overdensity as 
a function of smoothing scale, then this resembles a ran- 
dom walk - the usual assumption is that successive steps 
in the walk are independent of the previous ones. This is 
known to be a bad approximation, but because there are no 
known exact solutions to the case of realistic correlations, 
the assumption of uncorrelated steps has been routine. This 
is despite the fact that Peacock & Heavens (1990) showed 
how to derive a reasonably accurate expression for the spher- 
ical collapse problem and correlated steps. However, it has 
received little use, presumably because it is only an approxi- 
mation, whereas the corresponding problem of spherical col- 
lapse with uncorrelated steps was solved exactly shortly af- 
ter their paper appeared (Bond et al. 1991). Bond et al.. 
also described a simple numerical solution to the correlated 
steps problem; it too has received little attention. 

Recently, however, there has been renewed interest in 
the correlated steps problem: Maggiore & Riotto (2010a) 
have introduced field theory techniques to address this prob- 
lem. In essence, this approach aims to solve analytically the 
same path integrals that Bond et al. solved numerically. 



This technical machinery is too complicated to solve exactly, 
but, at the end of the day, it does provide a simple analytic 
approximation for cluster abundances in the spherical col- 
lapse model. There is as yet no similarly simple expression 
for the case in which Sc is mass-dependent (although see De 
Simone, Maggiore & Riotto 2011a for a treatment of corre- 
lations induced by non-Gaussian initial conditions). 

The present paper is motivated by the fact that the 
field theoretic approach yields approximate rather than ex- 
act expressions. So it is interesting to ask how it compares 
to the older Peacock-Heavens approximation. In Section [2l 
we show that the older approximation is, in fact, the more 
accurate of the two, for the case of a constant, determin- 
istic barrier (see below for a discussion of the case when 
the barrier height is stochastic). Therefore, the main goal of 
the current paper is to show how the analysis of Peacock & 
Heavens can be extended to the case of mass-dependent Sc. 
Section shows that this can be done almost trivially. Sec- 
tion[3]discusses an ansatz which relates the shape of the first 
crossing distribution of walks that are constrained to pass 
through a certain point before first crossing the barrier to 
the shape of the unconditional first crossing distribution. We 
combine this with the Peacock-Heavens approach to provide 
a rather accurate approximation of the conditional distribu- 
tion. A final section summarizes and discusses some implica- 
tions. Appendix |X] provides an alternative derivation of the 
Peacock-Heavens ansatz which yields some insight into the 
nature of their approximation, while Appendix iBl contains 
technical details about smoothing windows. Appendix [C] in- 
cludes a discussion of problems in which the barrier height 
is stochastic, an issue which has gained considerable recent 
interest (Maggiore & Riotto 2010b, Corasaniti & Achitouv 
2011b). We argue that these latter treatments correspond 
to making some specific technical choices which are difficult 
to test, and that there are in fact several other (testable) 
options when dealing with stochastic barriers, which remain 
to be explored. 

In a related paper, we address the question of halo bias 
(Paranjape & Sheth 2011), which is associated with a cer- 
tain limit of our solution of the constrained walks problem. 
And in a third, we address the question of voids (Paranjape, 
Lam & Sheth 2011): here, the problem is the generalization 
of the Peacock-Heavens approximation to the case of two ab- 
sorbing barriers rather than just one. This also works rather 
well. All of these analyses assume the initial fluctuation field 
was Gaussian. The case of non-Gaussian initial conditions 
is discussed by Musso & Paranjape (2011). 



2 THE CONSTANT BARRIER PROBLEM 

For what follows, it will be useful to define 

-UR)^ j'^^^^k^'W\kR), (1) 

where P{k) is the power spectrum of initial density fluctu- 
ations, (linearly extrapolated to present epoch) and is a 
smoothing filter. The quantity ^^{R) measures the variance 
in the field on scale R. We will reserve the symbol s (or S) 
to denote the variance, 

s = al{R) . (2) 
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We will also make use of the combination 
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For P{k) oc k" and a Gaussian filter W{kR) — e 
which we will use to illustrate many of our results, aj cx 
^-3-n-2j 72 ^ (3 + n)/(5 + n). 

The Excursion set ansatz relates the abundance of halos 
of mass m to the fraction of random walks which first cross 
a barrier of height 5c on scale s{R), where m = pA-KR^ /Z. If 
f(s) As denotes this fraction, then 



m dn(m) 
p dm 



dm = /(s) ds. 



(4) 



We argue elsewhere that this relation between halo abun- 
dances and the first crossing distribution is not the full story. 
In what follows, we are mainly interested in making accurate 
estimates of the first crossing distribution. 



2.1 Numerical (Monte-Carlo) solution: TopHat 
smoothing and ACDM P(fc) 

Figure [T] shows the result of Monte-Carloing the first cross- 
ing distribution associated with a barrier of constant height 
Sc- In the top panel, the black histogram (the one which has 
more counts at large y = 5^ / s) shows the Monte-Carloed dis- 
tribution for walks with uncorrelated steps. The solid curve 
going through it shows the associated analytic expression 
for the first crossing distribution (equation |6] below) . The 
agreement indicates that the numerical algorithm works. 

The red histogram shows the result when the steps 
are correlated. In practice, we transformed each walk with 
uncorrelated steps into one with correlations by applying 
smoothing filters of different scales following Bond et al. 
(1991). In this case, the correlation depends on the form of 
the filter and on the shape of the initial linear theory power 
spectrum P{k). We used a Tophat smoothing filter and a 
ACDM power spectrum appropriate for (f2m = 0.25, ~ 
1 — Omj/i = 0.7, (T8 = 0.8). We then performed the same 
analysis as for the uncorrelated walks; find and store the 
scale on which 5c is first crossed. Note that now the relation 
between s and M is modified compared to the previous case; 
we have checked that our algorithm does this correctly. 

The dotted curve shows the first crossing distribution 
associated with what we call completed correlated walks be- 
low (equation [8] below), a limiting case that will prove useful 
for understanding many of the results to follow. The short 
and long dashed curves show the Peacock-Heavens approxi- 
mation (equation[l7]below) , and the dot-dashed curve shows 
the approximation from equation 119 of Maggiore & Ri- 
otto (2010a). Both approximations have one free parameter, 
which we have set to the value appropriate for the walks 
shown in Figure [1] For Maggiore-Riotto, this parameter is 
K = 0.45. For Peacock-Heavens, the parameter F is actu- 
ally scale dependent (see Figure |4] below): the short dashed 
curve shows the result of setting F = 4 and ignoring this 
dependence; the long-dashed curve, which provides a better 
description of the numerical solution, includes this depen- 
dence. (For reasons that we discuss later, we calculated F 
for the long-dashed curve assuming a Gaussian rather than 
TopHat fiher.) 

The bottom panel of Figure [1] shows the residuals 
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Figure 1. Top panel: Distribution of the scale s on which walks 
which first cross 5c (histograms) for a ACDM P{k). The his- 
togram which shows more objects at large y = is from walks 
with uncorrelated steps (labelled sharp-fc); the other shows the 
result for walks with correlated steps due to TopHat smoothing. 
Solid curve shows the analytic prediction for uncorrelated steps 
from Bond et al. (1991); dotted curve is this divided by a fac- 
tor of 2, for completely correlated steps; the two dashed curves 
show two implementations of the Peacock & Heavens (1990) ap- 
proximation for a Gaussian filter (see text for details); and the 
dot-dashed curve shows the approximation of Maggiore & Ri- 
otto (2010a). Bottom panel: Red squares show the residuals (with 
Poisson errors) between the walks with correlated steps and the 
scale-dependent implementation of the Peacock- Heavens approx- 
imation. Blue dashed curve shows the relative difference between 
the Maggiore-Riotto and Peacock-Heavens results. For compari- 
son, the black triangles (which were given a small horizontal offset 
for clarity) show the residuals between the walks with uncorre- 
lated steps and the corresponding analytic prediction of Bond et 
al. 



(red squares) with Poisson errors between the TopHat 
CDM walks and the Peacock-Heavens approximation with 
scale-dependent F, and the relative difference between the 
Maggiore-Riotto result and the Peacock-Heavens approxi- 
mation (dashed blue curve). For comparison, we also show 
the residuals between the walks with uncorrelated steps 
and the corresponding analytic prediction from Bond et al. 
(1991) (black triangles). The points for uncorrelated steps 
were given a small horizontal offset for clarity. 
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Figure 2. Distribution of the scale s on which walks which first 
cross Sc (histograms) for P{k) oc k~^'^. The histogram which 
shows more counts at large y = S'^/s is from walks with uncorre- 
lated steps (labelled sharp-fc); the other two histograms show the 
result for walks with correlated steps, associated with TopHat 
and Gaussian smoothing filters. Solid curve shows the analytic 
prediction for uncorrelated steps from Bond et al. (1991); dotted 
curve is this divided by a factor of 2, for completely correlated 
steps; and dashed curve shows the Peacock & Heavens (1990) 
approximation for a Gaussian filter, with T = 4.13. 

This shows that the Peacock-Heavens approximation for 
the first crossing distribution is more accurate than that of 
Maggiore & Riottcf^- In what follows, we will show that it 
is also more easily extended to treat more general power 
spectra, more general smoothing filters, and more general 
barrier crossing problems. (In private communications with 
us, Maggiore & Riotto have emphasized that they do not 
believe that equation 119 of Maggiore & Riotto 2010a can 
be applied directly, without further computation, to the case 
of more general smoothing filters or power spectra.) 

2.2 Power-law P{k) and other smoothing filters 

Figure [2] shows the first crossing distribution of a constant 
barrier when P{k) oc fc~^'^. From top to bottom, the his- 
tograms show results for sharp-fc (black), TopHat (blue), 
and Gaussian (red) smoothing filters. (We note again that 
the relation between s and M is filter dependent; our al- 
gorithm accounts for this correctly.) The similarity of the 
TopHat and Gaussian histograms confirms a point made by 
Bond et al. (1991): When expressed as a function of y, the 
first crossing distribution associated with TopHat smooth- 
ing filters is approximately the same as for Gaussian filters. 
(Since the relation between y and m is filter-dependent, this 



^ The latter shows discrepancies at the level of ^ 15%, in keeping 
with the fact that their treatment is a linearization in k ^ 0.4. 



means that, when expressed as a function of m, the first 
crossing distribution does depend on filter.) 

The similarity of these distributions to those for the 
ACDM power spectrum illustrates another point that was 
implicit in the results of Bond et al: When expressed as a 
function of y, the first crossing distribution is approximately 
independent of power-spectrum. (The choice P{k) cc k~^'^ is 
not special; we find qualitatively similar results when P{k) oc 
k~'^.) Note that, for sharp-fc smoothing, this independence 
of P{k) is exact. 

The solid and dotted curves are the same as in the pre- 
vious Figure; they show the first crossing distributions asso- 
ciated with walks that have uncorrelated (sharp-fc filtering; 
equation |6]) and completely correlated steps (equation |8]). 
Notice that both the Gaussian/TopHat solutions approach 
the dotted curve asymptotically at large y, but lie above 
it at smaller y. We will have more to say about this later. 
The dashed curve shows the Peacock-Heavens approxima- 
tion (equation [17] below) for Gaussian filtering (for which 
r = 4.13). It is in reasonable agreement with the Monte- 
Carlo solution, slightly overshooting at the peak. It hap- 
pens, just coincident ally, to provide a good description of 
the TopHat case. This shows that the Peacock-Heavens ap- 
proximation is rather accurate for a wide variety of power 
spectra and smoothing filters. (We have checked that the 
relative differences between this approximation and that of 
Maggiore & Riotto are approximately independent of P{k) 
and filter - indicating that the latter may have wider appli- 
cability, as it stands, than simply to TopHat smoothing of a 
ACDM power spectrum.) 

Having motivated why the Peacock-Heavens approxi- 
mation is so interesting, we now discuss why it works, be- 
fore using the insight gained to extend the approach to other 
barriers and barrier crossing problems. 

2.3 Analytic approximation: Completely 
correlated steps 

Walks with uncorrelated steps are jagged and stochastic. For 
such walks, the first crossing distribution is 

/„(s)ds = = -|-erf(5,/^) (5) 

OS OS 

= ; e " — (6) 

(Chandrasekhar 1943; Bond et al. 1991), where Puis) is the 
"survival probability" that a randomly chosen walk has not 
crossed the barrier Sc prior to s. 

Walks with correlated steps are smoother. However, be- 
fore stating the Peacock-Heavens approximation for walks 
with correlated steps, we believe it is useful to study an- 
other case which can be solved exactly and which is in some 
sense the opposite of the uncorrelated steps problem. In this 
subsection, we will be interested in the limit in which the 
walks are as smooth and deterministic as possible. In par- 
ticular, we would like to think of such walks as having com- 
pletely correlated steps, where by complete correlation we 
mean that the height of the walk at one time completely 
specifies its value at all other times. 

This notion of a completely correlated walk is, at first 
sight, somewhat ambiguous, since a walk which has the same 
height 5 at all s could be said to completely correlated, but 
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a walk which is defined by a straight line from the origin 
through the point in question {6, s), or indeed, any curve /(s) 
whose value is completely specified by the pair (S, s), would 
also be completely correlated. However, if we now require 
that the ensemble of such completely correlated walks also 
satisfy the constraint that the fraction of walks which lie 
above 5 on scale s equals erfc((5/\/2s)/2, i.e., obeys Gaussian 
statistics, then it must be that the number which specifies 
each member of this ensemble is v = S/^/s. Thus, & v = 1 
walk is one which has height 5 = -^s on scale a\ a. u = 2 
walk is one which has height on scale s, etc. Notice 

that these are walks which are not constant height 5, but 
constant v: their height scales as yfs. We will refer to this 
family of walks as having completely correlated steps. 

Now consider a barrier of constant height 5c- The first 
crossing distribution associated with this family of walks 
will depend on the distribution of v. If this distribution is 
Gaussian, piv) = exp(— !/^/2)/\/27r, then the corresponding 
survival probability is 



As a result, 



s/c(s) = 



dPcjs) 
dins 



l+erf(5c/\/2s; 



ldPu{s) 
2 



sfu{s) 



(7) 



(8) 



91ns 2 ' 

this differs from the case of completely uncorrelated steps 
by the factor of two in the denominator. The origin of this 
factor is clear: if 5c > 0, then walks having i/ < can never 
cross 5c- For a Gaussian distribution of v, this means half 
the walks never cross 5c- This is a novel way to understand 
just what it is that Press & Schechter (1974) derived: their 
expression describes the first crossing distribution of walks 
having completely correlated steps (in the sense described 
above). 

Before moving on, note that it is trivial to extend this 
analysis to 'moving' barriers, whose height depends mono- 
tonically on s. For barriers B{s) which decrease with s, one 
simply replaces 5c — >■ B{s) in the expression above. 



Pcis] 



= i(l + erf(B(s)/x/2l)) , 
dPc{s) 



dins 

dln{B/^s) B{s) 
dins J2nl 



-B{sY /2s 



(9) 
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For barriers which increase with s, this is slightly more in- 
volved, as shown by Figure [3] which plots 5/y/s vs. s. 

Completely correlated walks correspond to horizontal 
lines on this plot. It is then obvious that for barriers B{s) 
such that B(s)/^/s has a minimum at some s = Scrit, there 
is no crossing of the barrier for s > Sent- (Since B{s) is 
monotonically increasing, B{s)/^/s monotonically increases 
for s > Scrit.) As a result, the survival probability becomes 



A(s) = 



(l + erf[B(s)/^]) , 
(1 + eri[B{Scrii)/^/2S~i\) 



if S < S'crit 
if S > S'crit 

(11) 

and the rate /c(s) is given by equation (|10p until s — S'crit, 
at which point it becomes and stays zero. We will return to 
this point later. 

To end this subsection, we note that an interesting ex- 
ample of an increasing barrier is one which increases as the 
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Figure 3. Completely correlated walks (horizontal lines) in the 
presence of various barriers. The solid curve is a constant bar- 
rier B{s) = 5c- The long-dashed red curves are for linear barriers 
B{s) = 5c(l + I3s/5l), with P = -1 (lower) and /3 = 0.5 (up- 
per), the latter displaying a minimum at s = S'crit = ^c//^- I'l'^ 
short-dashed blue curve is the square-root barrier B{s) = (5c(l -|- 
fjy/s/Sc), which mimics the constant barrier in that B{s)/y/s has 
a minimum at S'crit oo- See text for a discussion. 



square- root of s: 



B{s) = 5c{l + PyG/5c)- 



(12) 



In this case, although completely correlated walks of height 
u < P will never cross B{s), we have Scrit — >■ oo, so the 
complication associated with s > Scrit does not arise. This 
barrier is a convenient approximation to the 'moving bar- 
rier' associated with halos which form from an ellipsoidal 
collapse (Moreno et at 2009). While showing nontrivial s 
dependence, it retains the simplicity of the constant barrier 
(see Figure [3]). The rate of increase of this barrier (~ s^^"^) 
also serves as the dividing line beyond which Scrit takes fi- 
nite values and the complexity of equation pip comes into 
play. Shen et at (2006) argue that this form, with /? < 0, 
is also a convenient approximation to the barrier associated 
with the formation of sheets. 



2.4 Correlated, but not deterministic, steps 

The two limiting cases - of maximally stochastic and com- 
pletely deterministic walks - serve as useful guides for the 
construction of the first crossing distribution when there is 
some, but not complete, correlation between steps. E.g., for 
the case of very weak or very strong correlations, one might 
imagine perturbing around one or another of these solutions. 
We will argue below that the Peacock-Heavens approxima- 
tion may be thought of as perturbing around the case of 
complete correlation. 

We first briefiy restate their approximation, following 
the presentation of it in Bond et al- (1991). The Peacock- 
Heavens approximation derives from noting that the height 
of a walk on scale s is correlated with its height on scales 
that are within sF of it, where F is a parameter that depends 
on details of the filter, and will be defined below. One then 
asserts that the walk can be broken up into independent 
segments of length sF, and requires that the height of the 
walk is below the barrier after each step. It may be helpful 
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to think of this approximation as stating that, of the frac- 
tion of walks that are below after n steps, one must take 
the fraction that were also below after n — 1 steps, and the 
fraction of these that were below after n — 2 steps and so 
on. If we define 



c(< S\s) = I dx-^ ' ' 



2tv 



(13) 



then the Peacock-Heavens ansatz for the survival probability 
in the presence of a single barrier of height 5c is 



PpH(Sn) = C(< Sc\Sn) H ^^\^') 



(14) 



where the spacing between the Si is chosen such that each 
step is approximately independent of the previous ones. One 
then expresses the product as the exponential of a sum over 
logs, and then replaces the sum by an integral. If we define 



and 



then 

s/ph(s) = - 



p(s) = c(< S,\s) (15) 
E^s) = exp ^ Inp(s')) , (16) 
dPpHis) _ d[p{s)Ec{s)] 

(17) 
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p(s) Inp(s) 



= E,{s) i sf4s) - 



where fc was defined in equation ([8]). The Appendix dis- 
cusses why this is only an approximation to the exact solu- 
tion. 

What remains is to determine F. But before doing so, 
note that as F — )■ cxd, then the term in the exponential of 
equation p6p — >■ 0, so Ec itself — >■ 1, and in equation p7p 
/ph(s) — ^ /c(s), the distribution for completely correlated 
walks. Hence, if we view Ec{S) as a series in 1/F, then one 
may think of the Peacock-Heavens approximation as per- 
turbing around the case of smooth completely correlated 
walks. In fact, we can rewrite the survival probability as 



Pph(s) =Pc(s)£c(s): 



(18) 



where Pc(s) was defined in equation (O, and think of Ec{s) 
as a correction to the completely correlated case. On the 
other hand, note that the F — > limit does not reduce to 
the expression for uncorrelated steps, and is in fact not even 
well-defined. 

For Gaussian smoothing filters and generic power 
spectra, the Peacock-Heavens prescription for F is F = 
27r ln(2) [(ctq""!)/'^]' ~ where the (j| were defined in 

equation ([l}. This can be written as 



27rln(2) v/tVO"^. 



(19) 



To see what this implies, suppose that P{k) oc k". Then 
7^ = (3 n)/(5 + n), making F = 27r ln(2) ^(3 n)/2. 
For n = (-1.2,-2), F = (4.13,3.08). For a more reaF 
istic CDM spectrum. Figure |3] shows that F varies with 
scale (approximately logarithmically with s), with simi- 
lar numerical values. But if the filter is a TopHat in real 
space, then some of the integrals appearing in equation (|19p 
may diverge, so one must compute F slightly differently. 
In Appendix B, we show that for power law spectra F = 



Gaussian 
TH 

Rounded TH 



-0.5 



0.5 

lg(y = 8//S) 



1.5 



Figure 4. The Peacock- Heavens correlation length F for a CDM 
power spectrum, as a function of variance s, with three differ- 
ent filters : Gaussian (solid black) with Wq{x) = where 
X = kRc; TopHat (dashed red) with Wtii{x) = (3/x^)[sin(x) — 
a;cos(a;)] where x = kR'Yn; and a rounded TopHat (dotted blue) 
for which WrTHix) = WTu{x)e-'^^^^'>'^''/^°^^ where x = fciJ^TH- 
A given value of s will, in general, correspond to different values 
of Rq, i?TH and R^tu- 



2n ln(2) •\/(n -f l)(n -f 3)/(n — 3), so the Peacock-Heavens 
procedure is well-defined only if —3 < n < — 1. Note that the 
TopHat filter is known to be analytically troublesome with 
power law spectra, with even the variance s being undefined 
for n > 1. The TopHat is well-behaved with the CDM spec- 
trum though, and Figure [4] shows that F is well-defined in 
this case. Nevertheless, as Figures [T] and [2] show, the numer- 
ical results for TopHat filtering are actually better described 
by the analytical approximation for Gaussian filtering. 

The oscillatory behaviour of the Fourier transform of 
the TopHat filter, results in TopHat smoothed walks being 
less correlated than Gaussian smoothed walks. One there- 
fore expects that the value of F for the TopHat filter should 
be smaller than that for the Gaussian. Numerically, since 
the first crossing distributions for the two filters are not 
very different, the difference in values of F should at most 
be a factor of order unity, and this is true for the CDM 
spectrum. The dramatic discrepancy for power law spec- 
tra is most likely caused by the known bad behaviour of 
the TopHat in this case, combined with the fact that the 
Peacock-Heavens procedure is a perturbation around the 
completely correlated case, and therefore performs better 
for Gaussian smoothed walks, which are closer to complete 
correlation than are TopHat walks. 

One possibility for alleviating the problem of the 
TopHat filter with power law spectra is to round the sharp 
edge of the TopHat (of scale Rth) with, say, a Gaussian 
filter of a smaller scale ei?TH (Bond et al. 1991). The result- 
ing F is well-defined for all n > —3, and smaller than the 
Gaussian. Figure |4] shows the result for a CDM spectrum, 
using e = 0.1. In the remainder of the paper, we will use 
the Peacock-Heavens analysis for Gaussian smoothing and 
power law spectra, since this combination is simple, well- 
defined and accurate. The discussion above (and a compar- 
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ison of Figures [T] and [2} shows that, with some care, our 
conclusions can be generalized to more realistic power spec- 
tra and filters. 



3 MOVING BARRIERS 

One of the virtues of the Peacock- Ifeavens approximation is 
that it is easy to see how equation (|18|) should be extended 
to the case when 5c is no longer constant. The logic behind 
writing the correction factor i?c(s) as in equation p6p does 
not depend on the barrier being constant. Therefore, for any 
moving barrier B{s), this suggests setting 

p(s) = ci< B{s)\V^) , (20) 

in equation (|16|l . 

On the other hand, the discussion in Section [2. 31 shows 
that the survival probability for completely correlated walks 
Pc{s) must be handled with care, depending on the nature of 
the moving barrier. For barriers B{s) which are decreasing 
functions of s, we can still make the simple replacement 
Sc — >■ B{s) in equation ([7]). Figure [5] shows that, for barriers 
of the form 

B{s) = 3,(1 + Ps/5l) (21) 

with /? < 0, the resulting expression for the first crossing dis- 
tribution (dashed curve) describes the Monte-Carlo solution 
rather well. We also note that, for this case, the expression 
for completely correlated walks equation (|10|) (dotted curve) 
actually describes the numerical solution more accurately. 
We return to this in the final Discussion section. 

For barriers which increase with s, such that B{s)/^/s 
has a minimum at some s — Scrit, Section [2. 31 shows that we 
must use equation (jlip rather than (0 to calculate the com- 
pletely correlated survival probability Pc(s) to use in equa- 
tion (|18|) . This correctly accounts for walks that never cross 
the barrier. Figure [S] shows that this prescription works well 
in describing the Monte-Carlo solution for a linearly increas- 
ing barrier H2ip with /3 > 0, although there is a discrepancy 
at small y. Notice that in this case, the completely correlated 
expression does not perform well. 



4 CONSTRAINED WALKS 

The Peacock-Heavens approximation can also be extended 
to describe the first crossing distribution of walks which 
are conditioned to pass through some non-zero {S,S). In 
many respects, this problem highlights the difference be- 
tween walks with uncorrelated steps, and those which are 
completely correlated. 

Since a completely correlated walk is specified by a sin- 
gle number, if it is known to have height 5s on scale S, then 
it will have height 5s \^s/S on scale s. I.e., the expected dis- 
tribution of heights on scale s is a delta function, centered 
on 5s \/s/S. On the other hand, for uncorrelated walks, this 
distribution is Gaussian with mean 5s and variance s — S (we 
have assumed s > S). This means that, except for a shift of 
origin, the conditional walk follows the same statistics as the 
unconditioned one. Namely, one simply sets 5c — >■ 5c — 5s and 
s — >■ s — 5* in equation (|6]) for the first crossing distribution. 

Notice that the determinstic walk increases its height by 
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1.2 - Gaussian P = -l 
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Figure 5. Distribution of the scale s on which walks which first 
cross the barrier (5c(l+/3s/(5^) (histograms). The histogram which 
shows more objects at large y = S^/s is from walks with un- 
correlated steps; the other histogram shows the result for walks 
with correlated steps. Solid curve shows the analytic prediction 
for uncorrelated steps from Sheth (1998); dashed curve shows our 
extension of the Peacock & Heavens (1990) approximation for cor- 
related steps. Dotted curve shows the expression for completely 
correlated walks. 



an amount 5s{\/ s / S — 1) whereas stochastic walks have no 
change on average, but some walks will reach large heights 
because the rms increases as Vs — S. The general case will 
lie somewhere in between these two extremes: we might 
generically expect a milder increase in the expected height 
compared to the deterministic case, with a narrower distri- 
bution around the mean compared to the stochastic case. 
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4.1 A scaling ansatz based on bivariate statistics 

To extend the Peacock-Heavens approach to describe con- 
strained walks, we will make an ansatz which we justify 
later. Our ansatz is that the conditional distribution is just a 
rescaled version of the unconditional one, where the scaling 
variable is inspired by bivariate Gaussian statistics. Namely, 
we define 



Sc — rS ■ 



where r = Sx/\ 



sS and 

dk k^Pjk) 
k 27r2 



W{kR,)W{kRs). 



(22) 



(23) 



Notice that the integral above is similar to that which de- 
fines s and 5", the only difference being that here the two 
smoothing filters have different scales. Our ansatz is that, 
when expressed as ^10/(1/10), the conditional first crossing 
distribution will have the same shape as the unconditional 
distribution vfi^u), a point we will return to shortly. 

Equation (|22|l accounts for the fact that if the field is 
constrained to have value 5s on scale S, then its value on 
scale s will be distributed around a mean value of (Sx / S) 5s- 



For uncorrelated steps Sx 

(5c 



5", so s(l— r ) = s—S and hence 



making 



vio = [Oc — 5s)/Vs — S. This corresponds to simply shifting 
the origin of the walk from (0,0) to {5s, S), as expected 
from the previous discussion. Completely correlated walks 
have r = f; in this limit, our expression for vio correctly 
indicates no scatter around a mean value of 5s y^s/S. 

For Gaussian filtering of a power-law spectrum r = 
[2RsRs/iRl + i??)]("+^>/' and S cx 7?"'"+'', 

= [2/(f + (5/s)2/(3+"))]("+3)/2^ ^(^ 

s - S[2/{1 + (5'/s)2/(3+")]{"+3). To get some intuition into 
what this implies, it is helpful to consider the limit Rs ^ Rs- 
In this case, 5* < s, so uio ~ (5c - 2^''+^^^^5s)/Vs - 
This corresponds to a shift of origin that is larger than for 
the uncorrelated case, but much smaller than for the com- 
pletely correlated case, and a variance that is slightly smaller 
than for the uncorrelated case, but much larger than for the 
completely correlated case. The discussion above applies for 
other smoothing filters too, except that the numerical co- 
efficient 2'"+'^'''^ will change. In Appendix B we give some 
details for the TopHat filter. 



4.2 Comparison with Monte-Carlos 

For the Peacock-Heavens approximation, our ansatz means 
that we set 



c(< 5c\s) -> c(< 5c,s\5, S) 



i [f + eH(uio/V2)j 



(24) 



in equation (|15ll when computing equations (|I6(I . (If 8|) 
and HZ]). Figure [6] shows that this works reasonably well. 
The histograms show the first crossing distribution of a bar- 
rier of height 5c by the subset of walks which are conditioned 
to pass through (^Oi^o), for a few choices of 5o, and the 
smooth curves show our extension of the Peacock-Heavens 
approximation. Whereas the qualitative trend is easy to un- 
derstand - walks which start closer to the barrier (i.e. large 
5o) cross it after fewer steps so there are few left to first 
cross at s ^ 5 - our approach does a reasonable job of 
quantifying the effect. 



0.6 



>) 0.4 




5o = VSo 



-0.5 



lg(y = 8//s) 




Figure 6. First crossing distribution of a barrier of height Sc by 
the subset of walks which are conditioned to pass through {So, So), 
for a few choices of 5o (as labelled). Short dashed, solid and long- 
dashed curves show the analytic prediction from our extension 
of the Peacock & Heavens (1990) approximation (equation 1241 in 
equation I17|l , for Gaussian smoothing of a Gaussian field with 
P{k) QC A:-i-2. 



Our ansatz is that the conditional distribution is just a 
rescaled version of the unconditional one. We have tested it 
more directly as follows. Figure [7] shows the result of trans- 
forming each of the first crossing values s into the associated 
I'lo, and then plotting the conditional first crossing distri- 
bution as 2/10/(2/10), where j/10 = ufg. If our ansatz is good, 
then all the curves of the previous Figure should define a sin- 
gle universal curve. At least over the range of j/10 we have 
shown (j/10 > 0.3), and for the range of So we have stud- 



Excursion set with correlated steps 9 



0.4 



0.1 















unconditional ' — ■ — ' 












= 0.075 










So/ 8/ = 


-- 0.25 


=1- 














1 


































I 


i 












0.5 






1 



ig(yio = vio^) 

Figure 7. Conditional first crossing distributions of the previous 
Figure, expressed as a function of the scaling variable yio = i^jq 
(equation I22II . Symbols with error bars show the unconditional 
distribution from Figure [2] 

led {So/S^2 < 0.3 or so) they do. Moreover, this universal 
curve has the same shape as that for unconditional walks - 
which we have shown using symbols with error bars - indi- 
cating that our ansatz is indeed a good one. In particular, 
this indicates that most (if not all) of the discrepancy be- 
tween the smooth curves and Monte-Carlo'd histograms in 
Figure [6] is due to the inaccuracy of the Peacock-Heavens 
approximation for unconditioned walks. 

Our ansatz, which allows one to transform any uncon- 
ditional distribution into a conditional one (the accuracy of 
the resulting curve will be limited by that of the uncon- 
ditional distribution, of course), allows us to provide some 
insight into the recent work of De Simone et al. (2011b), 
who present a path-integral analysis of the constrained walk 
problem. Their equation (B30jf| with Sm (derived us- 
ing the path integral machinery after linearizing in k) is, in 
fact, equivalent to simply replacing u ^ [Sc — So{l + K)]/-ys 
in their expression for the unconditional mass function, and 
then only keeping terms up to linear order in k. Since their 
AC is the S — !> limit of our Sx/S ~ 1, we conclude that 
our simple ansatz of f — > i/io reproduces the detailed path 
integral result in the limit where path integral calculations 
have been performed (i.e. to linear order in k). Moreover, 
when S is small but not zero. Figures [6] and [7] show that 
the same ansatz correctly describes the numerical solution, 
without having to linearize in S'x /S — 1 (which, in any case, 

^ In an earlier version, their expressions B29-B31 contained an 
error, as can be seen by requiring consistency with Ma et al. 2011, 
or by simply demanding that the limit Sm — > 0, (5o — > reproduce 
the Maggiore-Riotto unconditional distribution. The factor 5c — Si, 
should be Sf, in the last line of each of equations B29-B31. The 
error was corrected in a subsequent version of the paper, after it 
was pointed out by us. 



is not small compared to unity). Of course, in our ansatz, 
Sx/S depends on scale, and appears in the denominator of 
1^10 as well. 

4.3 Extensions and generalizations 

Our treatment of the first crossing of 5c by walks which 
are constrained to pass through some {S, S) is easily ex- 
tended in two ways. One is to the 'two-barrier' problem: 
given that the walk first crossed the barrier Sco on scale So, 
what is the probability of first crossing the barrier Sd > Sco 
on scale Si > So? In this case, one uses c(< 5c\s) — )■ 
c(< 5ci,s|<5co,So) for So < s < Si, but c(< <5cls) c(< 
5cO, s|5c0, So) for s < So. It will be interesting to test how 
well this simple extension does, compared to a numerical 
solution. In principle, we could go on to estimate merger 
rates from the limit in which 5co — )■ 5ci (following Lacey 
& Cole 1993). In this limit, however, most walks will cross 
the barrier within a few steps. As a result, most crossings 
will happen before the walk has travelled a distance that is 
of order the correlation length FS, so the Peacock-Heavens 
approximation is no longer expected to work. 

The second generalization is the introduction of more 
constraints on the walks before (or after) they cross Sc. 
Again, to incorporate these, one must simply modify equa- 
tions (|24p and (|22|l appropriately, since the net effect of these 
additional constraints will be to change the mean and vari- 
ance of the typical walk height on scales that are different 
from the constrained scales. 



5 DISCUSSION 

We have presented an analysis of the first crossing distri- 
bution of a moving barrier by random walks having cor- 
related steps - the first analysis to explicitly compare an- 
alytic approximations with numerical (Monte-Carlo) solu- 
tions. For walks with uncorrelated steps, exact solutions are 
only known for a handful of special cases, although a good 
analytic approximation is available for the general case (Lam 

6 Sheth 2009). However, in the limit in which steps are com- 
pletely correlated, so the walks are smooth and determinis- 
tic rather than jagged and stochastic, we showed that this 
solution is straightforward, at least for barriers which are 
monotonic functions of time/scale (Section 12. 3|l . 

For the more general case of some, but not complete, 
correlation between steps. Peacock & Heavens (1990) pro- 
vide a simple approximation for the first crossing distribu- 
tion of a barrier of constant height (equation [17]). We showed 
that their approximation can be thought of as a correction 
to the solution for completely correlated steps (Section l2.4p . 
The correction involves a suitably defined correlation length 
r (equation I19|) . whose inverse can actually be thought of 
as an expansion parameter; complete correlation is the limit 
1/F — ^ 0. While the prescription itself does not involve a 
perturbation in l/F, one can understand that the approach 
works weU (Figure [21) because l/F « 0.2 (Figure [3|. We 
argued that this is also why the approach works better for 
Gaussian than for TopHat smoothed walks - the latter have 
smaller values of F. 

We also showed (Figure [ij that, when using a constant 
barrier height without scatter, the older Peacock-Heavens 
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approximation was more accurate than that of Maggiore 
& Riotto (2010a). This more recent work, based on field- 
theoretic methods, explicitly perturbs around the solution 
for completely uncorrelated steps. In this case, the perturba- 
tion parameter k, which should be ^ 1, is actually of order 
1/2. Since k is larger than one would like, and F is smaller, it 
is certainly interesting to attack the problem of walks with 
correlated steps from both directions. (See, e.g., Corasaniti 
& Achitouv 2011a for a recent extension of the path integral 
approach to moving barriers.) Additionally, as we discuss 
below and in the Appendix, it is also interesting to explore 
the accuracy of these approaches when extended to models 
in which the barrier height shows a scale-dependent scatter 
rather than being deterministic. 

In Section [31 we showed how to extend the Peacock- 
Heavens approximation to handle moving barriers. For bar- 
riers which decrease with time/scale, this extension is re- 
markably simple and remarkably accurate (Figure [S]). For 
barriers which increase sufficiently steeply with time/scale, 
the extension is slightly more complex, but still quite ac- 
curate. We summarize our results here: for a barrier B{s) 
which is monotonic in s, the first crossing distribution is the 
derivative of the survival probability /(s) — ~dsP{s), where 

P{s) ^ P4s)E,(s) . (25) 

Here Pc(s) is the survival probability for completely corre- 
lated walks, given by equation (|ll|l . in which Scrit is finite 
only when the barrier increases faster than ~ ^/s, and in 
particular Scrit — ^ oo for constant and decreasing barriers. 
The correction factor Ec{s) is given by 

E^s) = exp ^ lnc« B{s')\s')^ , (26) 

where c(< 5\s) was defined in equation p3|) . 

Although this extension to moving barriers is relatively 
straightforward - it is far simpler than for walks with uncor- 
related steps, or for the field theoretic approach - our com- 
parison with Monte-Carlo simulations indicates that there 
is room for improvement. In particular. Figure [S] showed 
that, for barriers which decrease sufficiently rapidly with 
s (for linear barriers, this is true for negative enough val- 
ues of P), the solution associated with completely correlated 
walks is an excellent approximation. I.e., the correction fac- 
tor -Ec(s) should simply be set to unity. It is easy to see why 
this happens: for barriers which fall steeply - where steep 
means B{s) changes height by more than ^/s over the cor- 
relation length scale sF - the barrier is almost vertical, and 
so one need not worry about walks which cross the barrier 
more than once. In this sense, the walks behave as though 
they are completely correlated. We are currently investigat- 
ing whether our simple extension of the Peacock-Heavens 
ansatz can be improved to make Ec depend, not just on the 
correlation length scale F, but on how F relates to the scale 
barrier shape. 

In Section IH we described a simple ansatz which al- 
lows one to transform the first crossing distribution of un- 
conditional walks into a rather good estimate of the first 
crossing distribution associated with walks which are con- 
strained to pass through a certain point in the {5, S')-plane 
before crossing the barrier. When applied to the Peacock- 
Heavens approximation, our ansatz for the conditional dis- 
tribution (eauations l24l and l22l) works rather well (Figure[6]). 



As models for the unconditional distribution improve, we 
expect our ansatz to continue to provide a useful approxi- 
mation, because the numerical solution does appear to scale 
as predicted (Figure [7|. 

The simplicity of our approach to the problem of condi- 
tioned walks derives from approximating the problem, which 
potentially involves n-point distributions, to bivariate dis- 
tributions. To impose more than one constraint one must 
simply modify equations H24p and (|22p : this is a straightfor- 
ward generalization that we did not explore further, but is 
clearly a straightforward way to incorporate Assembly bias 
effects of the sort identified by Sheth & Tormen (2004), and 
since studied by a number of authors. 

We were careful to state at the beginning that our pri- 
mary interest is in how well one can describe the first cross- 
ing distribution when steps are correlated. This is because, 
although our results provide increased understanding of halo 
abundances and evolution, a number of issues must be ad- 
dressed before they can be used to provide quantitative con- 
straints on cosmological parameters. 

First, the correlated walk problem is known to under- 
predict the abundances of clusters (Bond et al. 1991). There 
are at least three possible resolutions: i) the spherical model 
for collapse dynamics is wrong - Sc must be smaller, or fac- 
tors other than the initial overdensity matter (following, e.g., 
Sheth et al. 2001 for clusters); ii) there is substantial scatter 
around the actual value of 5c - in which case one must de- 
cide whether the appropriate solution is to make the barriers 
associated with the void-in-void and void-in-cloud problems 
fuzzy or stochastic (following Sheth et al. 2001, or Maggiore 
& Riotto 2010b, respectively, and also see discussion in Ap- 
pendix B of Sheth & Tormen 2002), or whether it is better 
to simply convolve our solution for fixed 5c with a distri- 
bution p( (5c) of values (e.g. Appendix [C)) : iii) in addition to 
accounting for correlated steps, we must account for the fact 
that the appropriate ensemble of walks over which to aver- 
age also contains correlations - in effect, our calculation has 
assumed each walk is a potential halo center, whereas only 
a few walks really are (compare Figures 2 and 3 in Sheth 
et al. 2001). For points ii) and iii) at least, this means that 
the fundamental assumption of the approach, equation Q, 
is incorrect. 

In addition, the solution to the two barrier problem as- 
sociated for walks with uncorrelated steps is known to give a 
better description of halo merger histories than is the one for 
correlated steps (Bond et al. 1991). Thus, even though we 
have shown how to extend the Peacock-Heavens approach so 
as to provide a reasonable description of the random walk 
problem, we will now have to account for at least one of the 
three issues discussed above, point iii) in particular, before 
we can claim to have a better model for halo formation. In 
the present case, this problem is two-fold, since it is easy 
to see that the Peacock-Heavens approximation will break 
when the barrier to be crossed is low, since then the assump- 
tion that the walk can travel a distance sF without crossing 
the barrier is no longer accurate. Further study along these 
lines is in progress. 
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APPENDIX A: APPROXIMATION 

For a random walk with n discrete time steps {si}, the joint 
probability distribution for the sequence of heights {5i} is a 
multivariate Gaussian with covariance matrix dj — (SiSj). 
The probability that, after each of the n steps, the height of 
the walk lies below 5^, is given by: 



P{Sr. 



dSl 



2tt 



dSn 



2^ |detC|i/2 



exp 



dSi 



dSn 



d\i 
2n 



dXn 

"2^ 



exp 



(Al) 



where the superscript T denotes a transpose of an n- 
dimensional vector. Rescaling the variables using 5i ^ Xi — 
Si/^/si; Xi ^ yi = Xiy/si, and defining the normalised co- 

and scaled barrier heights 



variance matrix 



P{s„) = 



dxi ■ 



dXr, 



dyi 
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2tv 



exp 
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(A2) 



In this language, the completely correlated limit is equiva- 
lent to setting Tij = 1 for all since this will result in the 
joint probability distribution being a univariate Gaussian in 
say x\, multiplying a product of Dirac delta's (5o(xj — x\). 
This will lead to precisely the survival probability discussed 
in Section [2.31 

This language also allows us to understand the Peacock- 
Heavens ansatz. One can easily check that the discretized 
survival probability in equation H14[) follows by choosing a 
sequence of A'' < n steps {s'^-*} and setting rij to the block- 
diagonal form 



/ ^(1) 



V 











\ 



(A3) 



with TV blocks {r^^-*} of length (sj — si_i), each correspond- 
ing to a set of completely correlated steps : r'j^' = 1 for 
all (i,j) in the 7*'^ block. The Peacock-Heavens prescrip- 
tion tells us to choose the sequence {s'^-*} to be logarithmi- 
cally equi-spaced with (As)/ ~ Ps'^^ This clarifies why the 
Peacock-Heavens ansatz is only an approximation, since the 
is different. E.g., for Gaussian smooth- 



ing with n = — 1 we have 



= 2, 



-I- Sj 



which is 



of course unity along the diagonal, but falls off gradually 
rather than in sharp jumps. Nevertheless, as the main text 
shows, the subsequent steps in the ansatz (namely, the con- 
tinuum limit and the evaluation of F) lead to a remarkably 
accurate prescription for the first crossing distribution. 



APPENDIX B: 
PARAMETER 



TOPHAT CORRELATION 



The Peacock-Heavens correlation parameter F for an ar- 
bitrary filter and power spectrum, follows from equations 
(3.24) and (3.15b) of Bond et al. (1991). The case of the 
Gaussian filter and generic power spectra was discussed in 
the main text. For a generic filter and power law power spec- 
trum P{k) oc fc", F is given by 



F = 27rln(2) 

' (Jp°° dxx^+^W{xf) (/p°° dxx^+^W'{xf 
[j^ dxx"+m{x)W'{x)f 



-1/2 



(Bl) 



where W{x) — W{kR) is the smoothing filter, and a prime 
denotes a derivative with respect to the argument. For the 
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TopHat we have 
Wix) = -Mx) ■ W'ix) = -(Mx)--Mx)), (B2) 

X X \ X J 

where ^0(2^) = sm{x)/x and ji{x) = (sin(x) — xcos{x))/x^ 
are spherical Bessel functions of the first kind. It is useful to 
define the integrals 

/"OO 

loo = / dxx"^'^jo{x)'^ , 
Jo 

Joi = / dxx'^*^jo{x)ji{x) , 
Jo 

111 = / dxx"ji{xf , (B3) 
Jo 

in terms of which we can write F for the TopHat filter as 

r = 2.1n(2) 'f 7''"' . (B4) 

The integrals in equation (|B3|) do not converge for all values 
of n. In particular, we have 

Joo = 2"" sin(n7r/2)r(ri- 1) [n(n- l)/4] , - 3 < n < -1 , 

Joi = -2""sin(n7r/2)r(ri-l)[(n + l)/2] , -3<n<l, 

/ii =2-"sin(n7r/2)r(rz-l)[(n + l)/(rz-3)] , - 3 < n< 1 , 

(B5) 

where T{z) is the Euler gamma function. 

The Peacock-Heavens correlation parameter is therefore 
defined for —3 < n < —1, and can be simplified to give 

T = 2.H2)^^^^ , -3<n<-l. (B6) 

The TopHat filter also leads to results slightly different from 
the Gaussian, for the conditioned walks discussed in Section 
4, in particular for the form of r. For example, if we set 
x = Rs/Rs < 1 then, for the TopHat filter, r = ^{5-x')/4 
if n = -2, so Sx/S 5/4 as S/s 0. If n = -1, then 
r = {2a;(l + a;^) + (1 - x^ log[(l - x)/{l + x)]}/{Ax''), so 
Sx/S — > 4/3 as S/s — >■ 0. And if n = 0, then the real-space 
TopHat is just like a sharp k-space filter, so Sx/S — > 1. 
Thus, in general, the numerical coefficient is smaller than 
when the filter is Gaussian. 



APPENDIX C: STOCHASTIC BARRIERS 

Treating the problem of halo formation by a single deter- 
ministic barrier is at best a crude approximation. In triaxial 
collapse models, themselves crude approximations, the col- 
lapse barrier is a function of three variables, the joint dis- 
tribution of which depends on scale (Sheth, Mo & Tormen 
2001). In excursion set language, at each step n, one asks 
if S„ > (5ec(e„,p„) (recall that n is monotonically related to 
the smoothing scale). Since and p„ are random numbers 
(they are neither Gaussian distributed, nor independent of 
Sn), one may think of S^c as being a function of n, so the 
problem is now that of a random walk (5„) crossing a barrier 
{Sec) whose height is stochastic function of n. If projected 
onto the (5, S) plane, this translates to scatter in the value 
of the critical density required for collapse and S (e.g. Fig- 
ure Bl in Sheth & Tormen 2002). 



Since the stochasticity of the barrier is related to that 
of e„ and p„, and these have variance proportional to that 
of 5n, one expects the variance of the barrier height to be 
proportional to S. If one further assumes that changes in 
the barrier height are uncorrelated with changes in S, and 
that the steps in the barrier height are drawn from a Gaus- 
sian distribution (even though steps in e and p are not), 
then, for uncorrelated steps, the net effect of the stochastic 
barrier is simply to rescale all variances: S — >■ S/a for some 
a < 1. This is most easily seen by noting that now one asks 
if Sn > Sc + bn, where b„ is a Gaussian number with variance 
Bn = DS„, for some D > 0. Since S and b are both Gaussian 
distributed, requiring Sn — b„ > Sc is the same as requiring 
Qn, a Gaussian number with variance Sn + Bn = Sn{l + D), 
exceed Sc. This rescaling of the variance means that the 
problem is the same as considering when the previous sum 
of n Gaussian numbers first exceeds Sc/\/l + D. This is at- 
tractive because just such a rescaling appears to be necessary 
to reconcile the shape of the first crossing distribution with 
the measured counts of halos in simulations: e.g., Sheth & 
Tormen (1999) suggest that D ~ 0.4. However, the rescal- 
ing of the variance which is most naturally associated with 
triaxial collapse models, 1 -I- D is not as large as the factor 
of 1/0.7 that is needed (Sheth, Mo & Tormen 2001), ff the 
fundamental ansatz of equation Q is correct (we noted in 
the final discussion section of the main text that Figures 2 
and 3 of Sheth et al. 2001 suggest it is not). 

For correlated steps the issue is more complicated, since 
one must now decide if the same smoothing process which 
correlates the steps in S also correlates the steps in the bar- 
rier height (and also: if the steps in S were previously inde- 
pendent of those in the barrier height, does the smoothing 
now correlate them?). In triaxial collapse models this is in- 
deed the case (e and p axe defined in the same physical 
volume as S). Hence, the steps in the barrier are correlated 
in the same way as for 5, so the net effect is again to simply 
rescale variances. In particular, the Peacock-Heavens corre- 
lation parameter F is not modified. 

On the other hand, if the stochasticity in the barrier 
height is due to processes on another scale, or to processes 
which have a different correlation structure than S, then F 
will be modified. E.g., if the steps in barrier height are un- 
correlated even when those in S are, then F — > F(l -I- D) 
where D is related to the variance in the barrier height (i.e., 
it also determines the rescaling of S). This is exactly anal- 
ogous to the rescaling k— >■ k/{1 ~\- D) which Maggiore & 
Riotto assume in their treatment of stochasticity. In a pa- 
per which appeared after an earlier version of the present 
work, Corasaniti & Achitouv (2011b) use this argument in 
the treatment of random walks and proceed to fit the re- 
sulting first crossing distribution to a mass function from 
Ai'-body simulations using equation Q, leaving D (and the 
barrier slope) as free parameters and finding agreement at 
the 5% level. We emphasize, however, that this rescaling is 
effectively invoking some unspecified process which operates 
on a scale that is not the same as that on which S was de- 
fined. More importantly, since the ansatz in equation Q 
is most likely incorrect (see above), conclusions about how 
reasonable the fitted value of D is are suspect, given the ob- 
vious differences between Figures 2 and 3 in Sheth, Mo & 
Tormen (2001). And finally, this treatment is motivated by 
the observed scatter in the value of Sc - rather than on theo- 
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retical considerations of triaxial collapse 
a new ambiguity as we discuss next. 



which introduces 



distribution for /3, then 



CI Distribution of deterministic barriers 

The discussion above noted that if the collapse barrier is a 
stochastic function of scale, then this will appear as scatter 
in the critical density at fixed S. But it is important to note 
that the converse is not true: the observation of scatter in 
the critical density at fixed S (e.g. Figures 2 and 3 of Sheth 
et al. 2001) does not imply that the barrier itself is stochas- 
tic. For example, one might find scatter in 5c at fixed S if 
there were simply a distribution of barrier shapes, each one 
of which was deterministic. To illustrate this (trivial) point, 
we consider three simple examples below. In all cases we il- 
lustrate our arguments using walks with uncorrelated steps, 
but note that the extension to correlated steps presents no 
conceptual difference. 



Cl.l Constant barriers 

The first assumes that the first crossing problem of interest is 
always that of a constant barrier; however, the barrier height 
is different for different walks. In this case, the distribution 
of interest is simply 



Sf{S) = / d5cp{5c)Sf{S\5c 



If we use 



p{5c) dSc 



dSc exp(-5^/2A2) 

a: 



A? 



(27r)3/2 



(CI) 



(C2) 



to model the distribution of 5c around a typical value Ac, 
then the integral above can be done analytically: 



SfiS) = - 



2 Ac/VS 



7r(H-Ai/5)2 



(C3) 



This shows that the small S <^ Ac tail is modified dramati- 
cally, from an exponential to a power-law. If /(s) is indeed 
related to halo counts via equation ((4)), then the exponen- 
tially falling tail of halo counts argues against this as being 
the relevant model of stochasticity. 



CI. 2 Linear barriers 

The model above has the same distribution of critical barrier 
heights at all S. A simple model which allows S-dependent 
scatter is to assume that the deterministic barrier is linear, 
5c + PS, but that different walks have different values of /3. 
(This has some merit, since S'-dependent scatter is suggested 
by Figures 2 and 3 of Sheth et al. 2001.) If we use a Gaussian 



SfiS) 



d/3p(/3) SfiSlP) 



d /^ . 



27rE2 



Vs 



1 + T.lS) 



-<52/2S(1+SE|) 



S{1 + ^S) 



27r 



(C4) 



When = then this reduces to the solution for a con- 
stant barrier. So the effect of stochasticity is almost like 
rescaling S — >■ S(l -|- E^S). In this case, the exponential 
cut-off at small S is not modified, but the distribution at 
large 5* is changed dramatically. The first term in the penul- 
timate expression may be thought of as a correction to the 
deterministic (3 = case. This factor is exp((5cE^/2) when 
S <C E|, but it decreases rapidly at large S. Again, if equa- 
tion (U) is correct, then the enhanced counts at small S is 
encouraging, but E/j may have to be rather large for this 
to be a competitive model. Of course, one is free to explore 
other models for p(/3). We will not do so here, because we 
feel we have made our main point - a distribution of deter- 
ministic barriers produces a distribution of critical densities 
at fixed S, and this distribution affects the quantity which 
enters equation Q. 



CI. 3 Square-root barriers 

Before closing our discussion, we think it is worth showing 
explicitly that, by choosing the barrier shapes and their dis- 
tribution appropriately, one can obtain effective first cross- 
ing distributions which closely approximate that for a single 
deterministic barrier with rescaled height (or, alternatively, 
with rescaled variance). This happens to be true if the de- 
terministic barriers have height 5c + P\fS, with a Gaussian 
distribution of /3. This case is tractable, because the first 
crossing distribution of a square-root barrier (of specified 
P) is known. Although the exact solution is complicated 
(Breiman 1966), it is quite well approximated by 

-{S^+PVsf /2S 

Sf{S\P) 



1 + 7^ 
4 



5c e 



5c I s/S ^/27^ 
(Sheth & Tormen 2002; Moreno et al. 2008), so that 



SfiS) 



I 



d/3. 



27rE2 



■Sf{S\P) 



(C5) 



(C6) 



y2 



4(1 + 



5c 



-S^/2S(l+Sj) 



S{1 + ^) 



2n 



This final expression is remarkable, because it shows that 
SfiS) has the same shape as the first crossing distribution 

of a barrier of constant height: i.e., P — and (Jc/yl + E^. 

The constant of proportionality differs from unity, presum- 
ably because the approximation we used for the crossing 
of a square-root barrier is not quite normalized. But even 
for E;3 = 0.65, for which the effective height is the desired 
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\/0.75c, this normalization factor is only 0.925. Therefore, 

we have not bothered to repeat this exercise using Breiman's 
exact expression for the first crossing distribution. 

CI. 4 The potential to test 

We conclude that such models for the stochasticity are po- 
tentially rather interesting. Perhaps more importantly, they 
arc testable. By following essentially the same steps taken 
by Slicth et al. (2001) to produce their Figures 2 and 3, it 
is rather straightforward to test if the stochasticity in 6c at 
fixed mass is indeed due to a distribution of deterministic 
barriers. The same is not true if the barrier height were a 
stochastic function of scale. Until such tests are performed, 
we argue that the question of how to correctly describe the 
observed scatter in barrier heights at fixed mass remains an 
open one. We leave this to future work. 




